26 research outputs found

    Identity in research infrastructure and scientific communication: Report from the 1st IRISC workshop, Helsinki Sep 12-13, 2011

    Get PDF
    Motivation for the IRISC workshop came from the observation that identity and digital identification are increasingly important factors in modern scientific research, especially with the now near-ubiquitous use of the Internet as a global medium for dissemination and debate of scientific knowledge and data, and as a platform for scientific collaborations and large-scale e-science activities.

The 1 1/2 day IRISC2011 workshop sought to explore a series of interrelated topics under two main themes: i) unambiguously identifying authors/creators & attributing their scholarly works, and ii) individual identification and access management in the context of identity federations. Specific aims of the workshop included:

• Raising overall awareness of key technical and non-technical challenges, opportunities and developments.
• Facilitating a dialogue, cross-pollination of ideas, collaboration and coordination between diverse – and largely unconnected – communities.
• Identifying & discussing existing/emerging technologies, best practices and requirements for researcher identification.

This report provides background information on key identification-related concepts & projects, describes workshop proceedings and summarizes key workshop findings

    Compound heterozygous mutations in UBA5 causing early-onset epileptic encephalopathy in two sisters.

    Get PDF
    To access publisher's full text version of this article, please click on the hyperlink in Additional Links field or click on the hyperlink at the top of the page marked FilesEpileptic encephalopathies are a group of childhood epilepsies that display high phenotypic and genetic heterogeneity. The recent, extensive use of next-generation sequencing has identified a large number of genes in epileptic encephalopathies, including UBA5 in which biallelic mutations were first described as pathogenic in 2016 (Colin E et al., Am J Hum Genet 99(3):695-703, 2016. Muona M et al., Am J Hum Genet 99(3):683-694, 2016). UBA5 encodes an activating enzyme for a post-translational modification mechanism known as ufmylation, and is the first gene from the ufmylation pathway that is linked to disease.We sequenced the genomes of two sisters with early-onset epileptic encephalopathy along with their unaffected parents in an attempt to find a genetic cause for their condition. The sisters, born in 2004 and 2006, presented with infantile spasms at six months of age, which later progressed to recurrent, treatment-resistant seizures. We detected a compound heterozygous genotype in UBA5 in the sisters, a genotype not seen elsewhere in an Icelandic reference set of 30,067 individuals nor in public databases. One of the mutations, c.684G > A, is a paternally inherited exonic splicing mutation, occuring at the last nucleotide of exon 7 of UBA5. The mutation is predicted to disrupt the splice site, resulting in loss-of-function of one allele of UBA5. The second mutation is a maternally inherited missense mutation, p.Ala371Thr, previously reported as pathogenic when in compound heterozygosity with a loss-of-function mutation in UBA5 and is believed to produce a hypomorphic allele. Supportive of this, we have identified three adult Icelanders homozygous for the p.Ala371Thr mutation who show no signs of neurological disease.We describe compound heterozygous mutations in the UBA5 gene in two sisters with early-onset epileptic encephalopathy. To our knowledge, this is the first description of mutations in UBA5 since the initial discovery that pathogenic biallelic variants in the gene cause early-onset epileptic encephalopathy. We further provide confirmatory evidence that p.Ala371Thr is a hypomorphic mutation, by presenting three adult homozygotes who show no signs of neurological disease

    Finding and sharing: new approaches to registries of databases and services for the biomedical sciences

    Get PDF
    The recent explosion of biological data and the concomitant proliferation of distributed databases make it challenging for biologists and bioinformaticians to discover the best data resources for their needs, and the most efficient way to access and use them. Despite a rapid acceleration in uptake of syntactic and semantic standards for interoperability, it is still difficult for users to find which databases support the standards and interfaces that they need. To solve these problems, several groups are developing registries of databases that capture key metadata describing the biological scope, utility, accessibility, ease-of-use and existence of web services allowing interoperability between resources. Here, we describe some of these initiatives including a novel formalism, the Database Description Framework, for describing database operations and functionality and encouraging good database practise. We expect such approaches will result in improved discovery, uptake and utilization of data resources. Database URL: http://www.casimir.org.uk/casimir_dd

    COPA syndrome in an Icelandic family caused by a recurrent missense mutation in COPA

    Get PDF
    To access publisher's full text version of this article, please click on the hyperlink in Additional Links field or click on the hyperlink at the top of the page marked FilesBackground: Rare missense mutations in the gene encoding coatomer subunit alpha (COPA) have recently been shown to cause autoimmune interstitial lung, joint and kidney disease, also known as COPA syndrome, under a dominant mode of inheritance. Case presentation: Here we describe an Icelandic family with three affected individuals over two generations with a rare clinical presentation of lung and joint disease and a histological diagnosis of follicular bronchiolitis. We performed whole-genome sequencing (WGS) of the three affected as well as three unaffected members of the family, and searched for rare genotypes associated with disease using 30,067 sequenced Icelanders as a reference population. We assessed all coding and splicing variants, prioritizing variants in genes known to cause interstitial lung disease. We detected a heterozygous missense mutation, p.Glu241Lys, in the COPA gene, private to the affected family members. The mutation occurred de novo in the paternal germline of the index case and was absent from 30,067 Icelandic genomes and 141,353 individuals from the genome Aggregation Database (gnomAD). The mutation occurs within the conserved and functionally important WD40 domain of the COPA protein. Conclusions: This is the second report of the p.Glu241Lys mutation in COPA, indicating the recurrent nature of the mutation. The mutation was reported to co-segregate with COPA syndrome in a large family from the USA with five affected members, and classified as pathogenic. The two separate occurrences of the p.Glu241Lys mutation in cases and its absence from a large number of sequenced genomes confirms its role in the pathogenesis of the COPA syndrome

    A homozygous loss-of-function mutation leading to CYBC1 deficiency causes chronic granulomatous disease

    Get PDF
    Publisher's version (útgefin grein) Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.Mutations in genes encoding subunits of the phagocyte NADPH oxidase complex are recognized to cause chronic granulomatous disease (CGD), a severe primary immunodeficiency. Here we describe how deficiency of CYBC1, a previously uncharacterized protein in humans (C17orf62), leads to reduced expression of NADPH oxidase’s main subunit (gp91phox) and results in CGD. Analyzing two brothers diagnosed with CGD we identify a homozygous loss-of-function mutation, p.Tyr2Ter, in CYBC1. Imputation of p.Tyr2Ter into 155K chipgenotyped Icelanders reveals six additional homozygotes, all with signs of CGD, manifesting as colitis, rare infections, or a severely impaired PMA-induced neutrophil oxidative burst. Homozygosity for p.Tyr2Ter consequently associates with inflammatory bowel disease (IBD) in Iceland (P = 8.3 × 10−8; OR = 67.6), as well as reduced height (P = 3.3 × 10−4; −8.5 cm). Overall, we find that CYBC1 deficiency results in CGD characterized by colitis and a distinct profile of infections indicative of macrophage dysfunction.We wish to thank the family of the two probands, as well as all the other individuals who participated in the study and whose contribution made this work possible.Peer Reviewe

    Database federation, resource interoperability and digital identity, for management and exploitation of contemporary biological data

    Full text link
    Modern research into the genetic basis of human health and disease is increasingly dominated by high-throughput experimentation and routine generation of large volumes of complex genotype to phenotype (G2P) information. Efforts to effectively manage, integrate, analyse and interpret this wealth of data face substantial challenges. This thesis discusses informatics approaches to addressing some of these challenges, primarily in the context of disease genetics. The genome-wide association study (GWAS) is widely used in the field, but translation of findings into scientific knowledge is hampered by heterogeneous and incomplete reporting, restrictions on sharing of primary data, publication bias and other factors. The central focus of the work was design and implementation of a core informatics infrastructure for centralised gathering and presentation of GWAS results. The resulting open-access HGVbaseG2P genetic association database and web-based tools for search, retrieval and graphical genome viewing increase overall usefulness of published GWAS findings. HGVbaseG2P conceptual modelling activities were also merged into a collaborative standardisation effort with international partners. A key outcome of this joint work is a minimal model for phenotype data which, together with ontologies and other standards, lays the foundation for a federated network of semantically and syntactically interoperable, distributed G2P databases. Attempts to gather complete aggregate representations of primary GWAS data into HGVbaseG2P were largely unsuccessful, chiefly due to concerns over re-identification of study participants. This led to a separate line of inquiry which explored - via in-depth field analysis, workshop organisation and other community outreach activities – potential applications of federated identity technologies for unambiguously identifying researchers online. Results suggest two broad use cases for user-centric researcher identities - i) practical, streamlined data access management and ii) tracking digital contributions for the purpose of attribution - which are critical to facilitating and incentivising sharing of GWAS (and other) research data

    The SNP Consortium website: past, present and future

    Get PDF
    The SNP Consortium website (http://snp.cshl.org) has undergone many changes since its initial conception three years ago. The database back end has been changed from the venerable ACeDB to the more scalable MySQL engine. Users can access the data via gene or single nucleotide polymorphism (SNP) keyword searches and browse or dump SNP data to textfiles. A graphical genome browsing interface shows SNPs mapped onto the genome assembly in the context of externally available gene predictions and other features. SNP allele frequency and genotype data are available via FTP-download and on individual SNP report web pages. SNP linkage maps are available for download and for browsing in a comparative map viewer. All software components of the data coordinating center (DCC) website (http://snp.cshl.org) are open source
    corecore